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1, INTRODUCTION 

Since the foundation of iris recognition devised by Daugman [1], it is then categorized under the 
most reliable biometrics exist [2-4]. This 1s supported by Wang and Han [5] whose stated 3 traits of iris 
biometrics; Firstly, iris is an internal body which protected, secondly, it cannot easily being altered without 
damaging its vision and lastly the dilation due to illumination response makes it hard to imitate. Although 
being considered as reliable, there are still open issues in iris recognition. Among them, one that could be 
how the recognition reacts towards the wearing of contact lens, whether it is soft or cosmetic lens. This issue 
has been thoroughly studied in [6] and [7], that apart from cosmetic lens, soft lens can leave significant 
degradation during verification. From their findings, the false reject rate for matching the same subject, by 
considering with soft lens as probe image and without soft lens as gallery image has resulted in 5.66% FRR. 
As a comparison, matching without lens and soft lens for both gallery and sensor images only resulted in 
1.17% and 1.67% respectively. 

To handle the emerging of 125 million contact lens wearers [8], it is crucial that one recognition 
system should have an early stage mechanism to detect the presence of contact lens [9], and what type the 
lens is. It 1s also statistically proven that a recognition system performance degrades while recognizing 
contact lens subjects [10], [11]. Unlike cosmetic lens, soft lens is worn to correct eye vision rather than for 
appearance purpose. Soft lens is usually colourless, while cosmetic lens may appear in wide variety of 
colours. Besides, soft lens is imperceptible unless been inspected carefully. Hence, detection of soft lens 
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would be a non-trivial task that requires a mathematical and statistical approach to automating the task. The 
presence of soft lens can be noticed for one distinct feature, which is the lens boundary located on top of the 
sclera region. We may assume this boundary can be easily detected, however, it comprises of very thin line 
and can be easily confused by noise resulted from inconsistent illumination. In this work, we take the 
challenge to segment the lens boundary and use it as features, then, train it using Support Vector Machine, in 
order to classify between with or without soft lens. The novelty of our work is the fusion of extracted features 
produces from two prominent descriptors, Histogram of Gradient (HOG) and Scale Invariant Feature 
Transform (SIFT). The details of our work are represented in the following sections. In Section 2, any 
previous works regarding contact lens detection are discussed. In Section 3, our proposed method is 
described in details. The experimental results and discussions are reported in Section 4 and lastly, Section 5 
draws the conclusion. 


2. RELATED WORKS 

During decades, cosmetic lens has gained a lot of attention in iris recognition community. A lot of 
research has been conducted extensively mainly under the subject of fake iris detection and iris spoofing. It 
was pioneered by Daugman [12] who managed to detect dot matrix cosmetic lens using Fourier transform. 
Then, Lee et al. [13] introduced the use of Purkinje image to detect fake iris. The research continues where 
He et al. [12] utilized gray-level co-occurrence matrix (GLCM) as feature descriptor and SVM as classifier. 
They reported 100% accuracy tested on self-database. Meanwhile, Wei et al. [15] proposed three methods for 
contact lens detection; iris edge sharpness, iris-textons and GLCM with SVM. Evaluation is done by using 
CASIA and BATH database with accuracy achieved above 76.8%. Zhang et al. [16] used SIFT-weighted 
Local Binary Pattern with SVM as classifier. They achieved 99% accuracy using self-database. Unlike 
cosmetic lens, soft lens has gained less attention in the community. This is due to the believe that soft lens 
wearing does not cause significant impact of degradation during wearing as supported in [17-19]. However, 
the awareness of the soft lens’s wearing impact in [6], [20] has ignited more researches being conducted. 

There are three approaches to soft lens detection, whether it is hardware, machine learning or image 
segmentation approach. Hardware approach requires the use of sophisticated camera. Such examples are 
from Kywe et al. [21] where they used a thermal camera to measure the decrement of temperature on the eye 
surface during the blinking of the eye. They observed that a certain degree of decrement indicates the 
wearing of soft lens. Another work by Lee et al. [22] claimed that Purkinje images between original and lens 
worn iris are difference. These images are captured using two collimated IR-LED cameras. Recent work by 
Hughes and Bowyer [23] detect the presence of lens by using stereo vision from two cameras. Soft lens 
wearing is detected if the captured image seen as curved surface rather than flat (without lens). 

Meanwhile, machine learning requires features descriptor and classifier to perform. Doyle et al. [9] 
used a modified Local Binary Pattern (LBP) as features descriptor and experimented with 14 different 
classifiers. They achieved 96.5% of correct classification for cosmetic lens. However, only 50.25% correct 
classification for soft lens. Kohli et al. in [24] experimented with four methods; iris edge sharpness, textural 
features based on co-occurrence matrix, gray level co-occurrence matrix and LBP with SVM. They reported 
that LBP with SVM has inferred the best result in overall. However, only 54.8% CCR achieved for soft lens. 
Later, Yadav et al. [7] extend the work in [9] and [24] with additional database and revised algorithm. They 
achieved CCR above 45.35%. Gragnaniello et al. in [25], [26] used sclera and iris region as features and 
applied Scale Invariant Descriptor as feature descriptor with SVM as classifier. They reported CCR above 
76.29%. Raghavendra et al. [27] proposed the use of Binarized Statistical Image Features (BSIF) with SVM. 
They achieved 62% for intra-sensor and 54% accuracy for inter-sensor. Silva et al. [29] used Convolutional 
Neural Network. They resulted in 65% for intra-sensor and 42.25% for inter-sensor. 

On the other hand, image segmentation approach uses edge detection technique to segment the thin 
lens boundary located on top of the sclera region. As to date, Erdogan and Ross [29] are the pioneer to 
implement this approach. They proposed a clustering based edge detection to segment the outer lens 
boundary which has achieved an accuracy of 70%. From the literature, it can be summarized that the 
hardware approach relies on the physical characteristic of the eye. Machine learning approach instead 
involves the discrimination between two templates [30] while image segmentation deals with pixels 
manipulation. 


3. PROPOSED METHOD 

In this work, we proposed a fusion of image segmentation and machine learning approach to classify 
two class problems of validating the presence of with or without soft lens. These approaches are chosen as 
machine learning has the ability to extract and learn from training data to infer decision for a new data as well 
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as image segmentation approach that robust to lens type and sensors. Careful inspection shows that the 
wearing of soft lens will leave an obscure or darker edge line on top of the sclera region as shown in 
Figure 1. It has been thoroughly studied in [25] that the sclera region define the best sample to prove the 
presence of soft lens. Therefore, our focus is to extract information on this region by locating the pixels of 
lens boundary and transform it into feature space by mean of a feature descriptor. These features are then to 
be trained by a statistical learning approach in order to be classified. The next sub-section presents the main 
three stages of the proposed method which are segmentation, features extraction and classification. The step 
by step processes of this method is depicted in Figure 2. 


Soft lens boundary _— 


Figure 1. Examples of soft lens boundary location as pointed by arrow 





3.1 Segmentation 

In the segmentation stage, the main objective is to locate the soft lens boundary which is deemed to 
reside on top of the sclera. It starts with segmentation of the left and right sclera region. We used the radius 
information provided with Notre Dame Cosmetic Contact Lens 2013 (NDCCL13) [31] to segment the iris 
boundary by the mean of Circular Hough Transform. Then, we extend the iris radius with additional 30 
pixels to include the sclera region concentrically. The right-side region is segmented between the upper value 
of 150 to lower value of 210 degree rotation and 30 to -30 degree rotation respectively at the left-side. 
Therefore, the segmentation process will produce two segmented images of left and right sclera. These 
images are then to be normalized to ensure equal dimension for every iteration. In this case, we applied 
Daugman’s rubber sheet normalization method [1]. The normalized images will have the size of 30 x 240 
pixels. All these steps are illustrated in Figure 3. Next, any presence of eyelash is removed using inpainting 
algorithm [32]. This is done by replacing the eyelash pixels with the mean of surrounding pixels. We 
observed that either one of the left or right normalized image will have a slightly brighter or darker lens 
boundary. This is due to the inconsistent illumination caused by poor flash lighting throughout the enrollment 
process. In order to overcome this, we proposed a method called summed-histogram, where the frequencies 
for each pixel’s intensity value in the whole image is summed. Whenever the sum value is greater, the 
normalized image is considered to have brighter lens boundary compared to the other. The summed- 
histogram of individual normalized image, S is represented as follows: 


255 
S1(r,8) = | _ » Prr0n + Fic @)nsa + P18 )ne-- (Eq.1) 
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Where S is the summed-histogram of normalized image I and P; is the pixel’s intensity in the corresponding 
normalised polar coordinates (7,@).The value of S will be the preliminary input for the next segmentation 
process. In this process, the ridge detection algorithm is employed to segment the lens boundary on the 
normalized image. Among others segmentation algorithm, ridge detection has the ability to detect edge while 
sharing the same background colour but differ in intensities, which resembles the lens boundary [33]. The 
output from the ridge detection algorithm is the segmented lens boundary in a form of binary image where 
lens boundary will be regarded as white (1) and the background as black (0). For without soft lens images, a 
blank image of black coloured will be generated. 
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Figure 3. The sclera segmentation and image normalization process 


3.2 Feature Extraction 

Under feature extraction stage, two input images are required; segmented lens boundary image and 
normalized image of sclera region. The segmented lens boundary image is extracted using a window based 
feature descriptor; Histogram of Gradient [34]. HOG is chosen as the segmented lens boundary has the 
properties of the edge orientation. By this mean, for every 1 x n pixel in the image, the frequency histogram 
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of edge orientation is computed. Then, the resulted edge orientation is quantized into b bins. In our 
experiment, n and Db are set to 4 and 9 respectively. From our observation, the gradient of lens boundary is 
prone to appear in horizontal scale, which 170 to 190 degree rotation. Meanwhile, the normalized image of 
the sclera region is descripted using Scale Invariant Feature Transform (SIFT) [35]. The utilization of SIFT is 
to adapt the properties of having different scales, rotation and inhomogeneous shape of lens boundary, 
without degrading the performance of detection. There are four main steps of SIFT; scale-space 
representation, keypoints detection, keypoints orientation and generate keypoint descriptor. During scale- 
space representation, the Gaussian blurring is applied to the whole image with the number of octave is 4 and 
level per octave are set to 3.Then, Difference of Gaussian (DoG) of the blurred image is obtained by 
subtracting subsequent scales in each octave, producing multiple image point of D(x, v,a). By this mean, a 
keypoint is detected when its value is smaller (local minimum) or larger (local maximum) than the 
surrounding point. Poorly localized points are excluded during keypoints detection. The next process is to 
assign an orientation to each keypoint by calculating its gradient directions and magnitudes in a region of 16 
x 16. Finally, these regions are broken into sixteen 4 x 4 window and accumulates them into 8 bins histogram 
with a weighted value of gradient magnitude. Therefore, there is 4 x 4 x 8 = 128 feature vectors for each 
keypoint. In this work, our interest is to use the description of edge orientation of HOG and the invariance 
edge properties through SIFT to form a distinguishable feature of without and with soft lens. This is done by 
concatenating both left and right features of HOG and SIFT for the respective iris image into a feature vector. 


3.3 Classification 

The concatenated features are trained using a non-linear SVM with radius basis function kernel. 10 
fold cross validation were utilized to obtain the fittest parameters of C and gamma in order to reduce 
overfitting. In order to classify between without or with soft lens, the images of without lens are also applied 
to the whole process from segmentation to feature extraction. Images with prior knowledge of soft lens 
presence will be labelled as positive samples and without lens images as negative samples. 


4. RESULTS AND ANALYSIS 

In this work, the iris images are retrieved from Notre Dame Cosmetic Contact Lens 2013 
(NDCCL13) [31] that contains grayscale iris images of without lens, with soft lens and with cosmetic lens. 
All images are captured using either LG4000 or AD100 NIR camera. For LG4000, all images are split into 
3000 for training and 1200 for testing while for AD100, 600 images are for training and 300 for testing. 
Table 1 shows the images distribution for NDCCL13 database. 


Table 1. NDCCL13 images class distribution 


Camera Class Label Training Testing 
No N 1000 400 
LG4000 Soft S 1000 400 
Cosmetic T 1000 400 
No N 200 100 
AD100 Soft S 200 100 
Cosmetic T 200 100 


All experiments are executed using Matlab R2017b on a machine with 2.3GHz and 6GB memory. 
We calculated confusion matrix for each class of different cameras. For uniformity with [7], we only reported 
correct classification rate (CCR) with its average. However, the evaluation of cosmetic lens in not within our 
scope as this work only focusing on two class classification of with or without soft lens. We annotated the 
class of without lens as N and soft lens as S. N-N refers to the probability of without lens samples are 
classified belongs to without lens while S-S refers to the probability of soft lens samples are classified 
belongs to soft lens. Comparisons are made with existing methods proposed in [7] by using classical LBP, 
[36] by using a modified version of LBP, [28] by using Convolutional Neural Network, [27] by using 
Binarized Statistical Image Features and [25] by using Scale Invariant Descriptor. During segmentation 
stage, each iris image will produce two segmented lens boundary images and two normalized images which 
derived from the left and right sclera. These images are used for training and testing throughout the 
classification stage. These samples of images are shown in Table 2. Therefore, for LG4000, there are 2000 
training images for without lens and another 2000 for soft lens. Meanwhile, for AD100, both without lens 
and soft lens constitute 400 training images each. The numbers of testing images are doubled up, the same 
manner as training images. 
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Table 2. NDCCL13 images class distribution 





Image Sclera Normalized image Segmented lens boundary 
04261d1016 = 
ee — ee 
06008458 | 


Table 3 reports the performance of the proposed method with the aforementioned state-of-the-art 
methods. The best results are written in bold. In summary, the proposed method has yielded the highest 
average for both cameras. However, the LG4000’s CCR for N-N classification has unable to surpass the 
method from [25]. In the meantime, the CCR for S-S classification resulted over 10 points from the nearest 
figure. It is observed that the results for AD100 are very marginal to achieve significant performance. We 
believe the less number of subjects is not sufficient to statistically generalize the variability between classes. 
Some of future works [37, 38] may be added for better performance in this research. 


Table 3. The correct classification rate and average of the proposed and state of the art methods 


Cameras Classification LBP [7] oe ee ui CNN [28] BSIF [27] ne sees 
N-N 70.00 81.25 85.50 84.50 76.50 95.75 85.63 
LG4000 S-S 60.15 65.41 45.25 7305 84.50 84.00 95.75 
Average 65.08 73.33 65.38 79.13 80.50 89.88 90.69 
N-N 42.00 42.00 81.00 73.00 79.50 79.00 80.20 
AD100 S-S 54.00 60.00 52.00 65.00 62.00 78.00 79.12 
Average 48 51 66.5 69 2005 78.5 79.66 


5. CONCLUSION 

This work proposed two class contact lens classification of with or without soft lens. In order to 
distinguish between these classes, we focused on extracting local information available on the sclera region. 
It is observed that the wearing of soft lens may leave thin lines which appear to be the lens boundary. 
Therefore, we proposed a segmentation mechanism to segment this boundary and utilize Histogram of 
Gradient to extract the gradient orientation of the boundary. We also descript the original sclera region with 
Scale Invariant Feature Transform in order to tackle the inhomogeneous shape and rotation of the lens 
boundary. These features are concatenated before trained and classified using a non-linear SVM. Results 
showed that the proposed method achieved the highest average accuracy compared to other state-of-the-art 
methods. 
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